expert distributions #3709

CUHKSZzxy · 2025-07-04T04:01:07Z

Refer to the dlBLAS. Not compatible with CUDA graph, used with --eager-mode

LMDEPLOY_DUMP_EXPERT_DISTRIBUTION=1 \
LMDEPLOY_EXPERT_DUMP_DIR="your_expert_distribution_dir" \
LMDEPLOY_DP_MASTER_ADDR=0.0.0.0 \
LMDEPLOY_DP_MASTER_PORT=29555 \
lmdeploy serve api_server \
    Qwen/Qwen3-235B-A22B-FP8 \
    --backend pytorch \
    --tp 1 \
    --dp 4 \
    --ep 4 \
    --proxy-url http://0.0.0.0:8001 \
    --nnodes 1 \
    --node-rank 0 \
    --eager-mode \
    --log-level INFO

CUHKSZzxy added 2 commits July 4, 2025 11:54

expert distributions

a84d356

add expert distribution recorder for deepseek

184ef28

CUHKSZzxy marked this pull request as ready for review July 4, 2025 04:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

expert distributions #3709

expert distributions #3709

Uh oh!

CUHKSZzxy commented Jul 4, 2025 •

edited

Loading

Uh oh!

Uh oh!

expert distributions #3709

Are you sure you want to change the base?

expert distributions #3709

Uh oh!

Conversation

CUHKSZzxy commented Jul 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

CUHKSZzxy commented Jul 4, 2025 •

edited

Loading